Vehicle unlocking method, system, and apparatus, vehicle, and storage medium

ABSTRACT

Example vehicle unlocking methods and apparatus are described. In one example method, wakeup keyword detection is performed on an obtained voice signal. If the voice signal includes a wakeup keyword, an acoustic source direction is determined, and a target camera is indicated to collect image data. The target camera is a target camera that is installed on the vehicle and whose angle of view can ensure that the acoustic source direction is captured. Then, the vehicle is unlocked based on the image data collected by the target camera.

CROSS-REFERENCE TO RELATED APPLICATIONS

This application is a continuation of International Application No.PCT/CN2020/077342, filed on Feb. 29, 2020, the disclosure of which ishereby incorporated by reference in its entirety.

TECHNICAL FIELD

This application relates to the field of vehicle technologies, and inparticular, to a vehicle unlocking method, system, and apparatus, avehicle, and a storage medium.

BACKGROUND

Currently, vehicles have become a main transport tool for people totravel, and people may unlock the vehicles by using car keys, toimplement a transport function and the like of the vehicles. However, avehicle cannot be unlocked when a car key is not present or the car keyis locked inside the vehicle. Currently, in addition to the car key, thevehicle may be unlocked by connecting an intelligent terminal to anin-vehicle infotainment. For example, a mobile phone is connected to thein-vehicle infotainment to replace the car key, to unlock the vehicle.Similarly, if the mobile phone is not present, or if the mobile phone isalso locked inside the vehicle, the vehicle cannot be unlocked, either.Therefore, there is an urgent need for a key-free vehicle unlockingmethod, in other words, no physical key is used.

In a related technology, a user may enter a password into a coded lockdisposed outside a vehicle body, to unlock a vehicle without a key.However, in an unlocking process using a password, the user needs tomanually enter a password after reaching a position of the coded lock,entire unlocking duration is relatively long, and security of unlockingusing a password is relatively low.

SUMMARY

This application provides a vehicle unlocking method, system, andapparatus, a vehicle, and a storage medium, to improve a speed andsecurity of a key-free vehicle unlocking method. The technical solutionsare as follows:

According to a first aspect, a vehicle unlocking method is provided,where the method may be applied to an in-vehicle infotainment includedin a vehicle, and the method includes:

obtaining a voice signal; determining an acoustic source direction basedon the voice signal if the voice signal includes a wakeup keyword, wherethe wakeup keyword is used to indicate that a vehicle currently needs tobe unlocked; indicating, based on the acoustic source direction, atarget camera to collect image data, where the target camera is a camerathat is installed on the vehicle and whose angle of view can ensure thatthe acoustic source direction is captured; and unlocking the vehiclebased on the image data collected by the target camera.

In this application, a microphone is installed on the vehicle, and themicrophone may collect a voice signal in an environment, and transmitthe collected voice signal to the in-vehicle infotainment.

It should be noted that the microphone is a low power consumptionapparatus. Therefore, the microphone may be in a working state at alltimes, to collect the voice signal in the environment in real time.

In this application, after the voice signal is obtained, wakeup keyworddetection may be performed on the voice signal, where the wakeup keywordis used to indicate that the vehicle currently needs to be unlocked. Ifit is detected that the voice signal includes the wakeup keyword, it maybe determined that the wakeup keyword detection succeeds.

Optionally, in this application, to further improve security, after thewakeup keyword detection is performed on the obtained voice signal,voiceprint feature matching may be further performed, and after thevoiceprint feature matching succeeds, the acoustic source direction isdetermined. To be specific, the determining an acoustic source directionbased on the voice signal if the voice signal includes a wakeup keywordincludes: if the voice signal includes the wakeup keyword, extracting avoiceprint feature in the voice signal; and if the extracted voiceprintfeature matches any stored voiceprint feature, determining the acousticsource direction based on the voice signal.

Optionally, the indicating, based on the acoustic source direction, atarget camera to collect image data includes: selecting, based on theacoustic source direction and from a plurality of cameras installed onthe vehicle, a camera whose angle of view can ensure that the acousticsource direction is captured, and determining the selected camera as thetarget camera; and indicating, based on the acoustic source direction,the target camera to collect the image data.

In this application, after determining the acoustic source direction,the in-vehicle infotainment may control the target camera to be turnedon to collect the image data. The target camera is a camera that isinstalled on the vehicle and whose angle of view can ensure that theacoustic source direction is captured.

In a possible case, the plurality of cameras are installed on thevehicle. The in-vehicle infotainment may select, based on the acousticsource direction and from the plurality of cameras installed on thevehicle, the camera whose angle of view can ensure that the acousticsource direction is captured, determine the selected camera as thetarget camera, and indicate, based on the acoustic source direction, thetarget camera to collect the image data.

Optionally, after determining the target camera, the in-vehicleinfotainment may further control the target camera to rotate to adjustan angle, so that a middle position of the angle of view of the targetcamera overlaps the acoustic source direction, and the target cameracaptures a complete figure image as far as possible.

In some other possible cases, only one camera may be installed on thevehicle, and the camera may be installed at a middle position outside aroof of the vehicle. In this way, the camera is the target camera. Afterdetermining the acoustic source direction, the in-vehicle infotainmentmay control the target camera to rotate to adjust an angle, so that theangle of view of the target camera can ensure that the acoustic sourcedirection is captured. For example, the middle position of the angle ofview of the target camera may be enabled to overlap the acoustic sourcedirection, and then the in-vehicle infotainment may indicate the targetcamera to collect the image data.

After the target camera is turned on to collect the image data, thetarget camera may send the collected image data to the in-vehicleinfotainment, and the in-vehicle infotainment may process the receivedimage data, to further determine whether to unlock the vehicle.

Optionally, the unlocking the vehicle based on the image data collectedby the target camera includes: matching the image data collected by thetarget camera with stored figure image data; and unlocking the vehicleif the image data collected by the target camera matches the storedfigure image data.

In this application, image processing may be facial recognition, gaitrecognition, iris recognition, or the like, and correspondingly, thefigure image data stored in the in-vehicle infotainment may include afacial feature, a gait feature, an iris feature, or the like. If imagedata matching is performed based on facial recognition or an irisfeature, the image data collected by the target camera may be an imageor a video. If image data matching is performed based on a gait feature,the image data collected by the target camera may be a video. In otherwords, in this embodiment of this application, there may be a pluralityof implementations of performing, by the in-vehicle infotainment, theimage data matching. The following describes two implementations.

In a first implementation, the figure image data stored in thein-vehicle infotainment includes a facial feature, and the in-vehicleinfotainment performs the image data matching based on facialrecognition.

Optionally, the stored figure image data includes a facial feature; thematching the image data collected by the target camera with storedfigure image data includes: extracting a facial feature of a figurerecognized in the image data; and the unlocking the vehicle if the imagedata collected by the target camera matches the stored figure image dataincludes: unlocking the vehicle if the facial feature of the figurerecognized in the image data matches any stored facial feature.

If image processing is performed on each region of an entire image, arelatively large amount of data needs to be processed. The acousticsource direction has been determined above, and a time interval fromwhen the acoustic source direction is determined to when the image datais collected is very short. Based on this, after obtaining the imagedata collected by the target camera, the in-vehicle infotainment mayalternatively extract a facial feature of a figure that is recognized inthe image data and that is located in the acoustic source direction, andunlock the vehicle if the extracted facial feature matches any storedfacial feature. To be specific, the in-vehicle infotainment may performthe facial recognition in a small range based on the acoustic sourcedirection, to greatly reduce a computing amount of image processing andfurther increase an unlocking speed.

Optionally, the stored figure image data includes a facial feature; thematching the image data collected by the target camera with storedfigure image data includes: extracting a facial feature of a figure thatis recognized in the image data and that is located in the acousticsource direction; and the unlocking the vehicle if the image datacollected by the target camera matches the stored figure image dataincludes: unlocking the vehicle if the extracted facial feature matchesany stored facial feature.

Optionally, the extracting a facial feature of a figure that isrecognized in the image data and that is located in the acoustic sourcedirection includes: determining an image region from the image databased on the acoustic source direction; and extracting a facial featureof a figure recognized in the image region.

Optionally, the determining an image region from the image data based onthe acoustic source direction includes: determining a relative positionof the acoustic source direction in the angle of view of the targetcamera; determining a reference position on the image data based on therelative position; and extending the reference position on the imagedata to each of a left side and a right side by a reference neighborhoodrange, to obtain the image region.

In a second implementation, the figure image data stored in thein-vehicle infotainment includes a gait feature, and the in-vehicleinfotainment performs the image data matching based on gait recognition.

It can be learned from the foregoing that the image data collected bythe target camera may be a video. In this implementation, afterreceiving the image data collected by the target camera, the in-vehicleinfotainment may extract a gait feature of a figure recognized in thevideo, and unlock the vehicle if the gait feature of the figurerecognized in the image data matches any stored gait feature. To bespecific, the image data collected by the target camera is a video, andthe stored figure image data includes a gait feature; the matching theimage data collected by the target camera with stored figure image dataincludes: extracting a gait feature of a figure recognized in the video;and the unlocking the vehicle if the image data collected by the targetcamera matches the stored figure image data includes: unlocking thevehicle if the gait feature of the figure recognized in the image datamatches any stored gait feature.

Optionally, the image data collected by the target camera is a video,and the stored figure image data includes a gait feature; the matchingthe image data collected by the target camera with stored figure imagedata includes: extracting a facial feature of a figure recognized in theimage data; determining, as a target figure, a recognized figure whosefacial feature matches any stored facial feature; extracting a gaitfeature of the target figure recognized in the image data; and unlockingthe vehicle if the gait feature of the target figure matches any storedgait feature.

In this application, if gait recognition is performed on all image dataof the collected video, a relatively large amount of data needs to beprocessed. Therefore, the in-vehicle infotainment may perform facialrecognition on the collected video, to obtain a figure matching anystored facial feature, further track the figure, and perform gaitrecognition only on the figure in the video. A process of performing thefacial recognition on the collected image data may be: processing imagedata of an entire image, or processing image data in a small range basedon the acoustic source direction. For a related implementation, refer tothe foregoing related descriptions, and details are not described hereinagain.

Optionally, after the matching the image data collected by the targetcamera with stored figure image data, the method further includes: ifthe image data collected by the target camera does not match the storedfigure image data, indicating the target camera to be turned off.

In this application, if the image data, for example, an image or avideo, collected by the target camera does not match the stored figureimage data, for example, facial recognition fails, gait recognitionfails, iris recognition fails, no face is recognized, no gait isrecognized, or no iris is recognized, the in-vehicle infotainment maycontrol the camera to be turned off, to reduce power consumption.

According to a second aspect, a vehicle unlocking apparatus is provided.The vehicle unlocking apparatus has a function of implementing behaviorin the vehicle unlocking method in the first aspect. The vehicleunlocking apparatus includes one or more modules, and the one or moremodules are configured to implement the vehicle unlocking methodprovided in the first aspect.

In other words, this application provides a vehicle unlocking apparatus,applied to a vehicle, where the apparatus includes:

an obtaining module, configured to obtain a voice signal;

a determining module, configured to determine an acoustic sourcedirection based on the voice signal if the voice signal includes awakeup keyword, where the wakeup keyword is used to indicate that thevehicle currently needs to be unlocked;

an indication module, configured to indicate, based on the acousticsource direction, a target camera to collect image data, where thetarget camera is a camera that is installed on the vehicle and whoseangle of view can ensure that the acoustic source direction is captured;and

an unlocking module, configured to unlock the vehicle based on the imagedata collected by the target camera.

Optionally, the indication module includes:

a first determining unit, configured to select, based on the acousticsource direction and from a plurality of cameras installed on thevehicle, a camera whose angle of view can ensure that the acousticsource direction is captured, and determine the selected camera as thetarget camera; and

a first indication unit, configured to indicate, based on the acousticsource direction, the target camera to collect the image data.

Optionally, the unlocking module includes:

a matching unit, configured to match the image data collected by thetarget camera with stored figure image data; and

an unlocking unit, configured to unlock the vehicle if the image datacollected by the target camera matches the stored figure image data.

Optionally, the stored figure image data includes a facial feature;

the matching unit is specifically configured to:

a first extraction subunit, configured to extract a facial feature of afigure recognized in the image data; and the unlocking unit includes:

a first unlocking subunit, configured to unlock the vehicle if thefacial feature of the figure recognized in the image data matches anystored facial feature.

Optionally, the stored figure image data includes a facial feature;

the first extraction subunit is further configured to:

extract a facial feature of a figure that is recognized in the imagedata and that is located in the acoustic source direction; and the firstunlocking subunit is further configured to:

unlock the vehicle if the extracted facial feature matches any storedfacial feature.

Optionally, the first extraction subunit is specifically configured to:

determine an image region from the image data based on the acousticsource direction; and

extract a facial feature of a figure recognized in the image region.

Optionally, the first extraction subunit is further specificallyconfigured to:

determine a relative position of the acoustic source direction in theangle of view of the target camera;

determine a reference position on the image data based on the relativeposition; and

extend the reference position on the image data to each of a left sideand a right side by a reference neighborhood range, to obtain the imageregion.

Optionally, the image data collected by the target camera is a video,and the stored figure image data includes a gait feature;

the matching unit includes:

a second extraction subunit, configured to extract a gait feature of afigure recognized in the video; and the unlocking unit includes:

a second unlocking subunit, configured to unlock the vehicle if the gaitfeature of the figure recognized in the image data matches any storedgait feature.

Optionally, the image data collected by the target camera is a video,and the stored figure image data includes a gait feature;

the second extraction subunit is specifically configured to:

extract a facial feature of a figure recognized in the image data;

determine, as a target figure, a recognized figure whose facial featurematches any stored facial feature; and

extract a gait feature of the target figure recognized in the imagedata; and

the second unlocking subunit is specifically configured to:

unlock the vehicle if the gait feature of the target figure matches anystored gait feature.

Optionally, the determining module includes:

an extraction unit, configured to: if the voice signal includes thewakeup keyword, extract a voiceprint feature in the voice signal; and

a second determining unit, configured to: if the extracted voiceprintfeature matches any stored voiceprint feature, determine the acousticsource direction based on the voice signal.

Optionally, the unlocking module further includes:

a second indication unit, configured to indicate the target camera to beturned off if the image data collected by the target camera does notmatch the stored figure image data.

According to a third aspect, a vehicle unlocking system is provided,where the system includes a vehicle unlocking apparatus and at least onecamera. The vehicle unlocking apparatus is configured to obtain a voicesignal, and determine an acoustic source direction based on the voicesignal if the voice signal includes a wakeup keyword, where the wakeupkeyword is used to indicate that a vehicle currently needs to beunlocked. The vehicle unlocking apparatus is further configured to sendan instruction to a target camera based on the acoustic sourcedirection, and unlock the vehicle based on received image data collectedby the target camera. The target camera is configured to collect theimage data according to the instruction sent by the vehicle unlockingapparatus, where the target camera is a camera that is installed on thevehicle and whose angle of view can ensure that the acoustic sourcedirection is captured.

Optionally, the vehicle unlocking apparatus is further configured toselect, based on the acoustic source direction and from a plurality ofcameras installed on the vehicle, a camera whose angle of view canensure that the acoustic source direction is captured, and determine theselected camera as the target camera.

Optionally, the step of unlocking, by the vehicle unlocking apparatus,the vehicle based on the received image data collected by the targetcamera is specifically:

matching the image data collected by the target camera with storedfigure image data; and

unlocking the vehicle if the image data collected by the target cameramatches the stored figure image data.

Optionally, the stored figure image data includes a facial feature;

the step of matching, by the vehicle unlocking apparatus, the image datacollected by the target camera with the stored figure image data isspecifically:

extracting a facial feature of a figure recognized in the image data;and

the step of unlocking, by the vehicle unlocking apparatus, the vehicleif the image data collected by the target camera matches the storedfigure image data is specifically:

unlocking the vehicle if the facial feature of the figure recognized inthe image data matches any stored facial feature.

Optionally, the image data collected by the target camera is a video,and the stored figure image data includes a gait feature;

the step of matching, by the vehicle unlocking apparatus, the image datacollected by the target camera with the stored figure image data isspecifically:

extracting a gait feature of a figure recognized in the video; and

the step of unlocking, by the vehicle unlocking apparatus, the vehicleif the image data collected by the target camera matches the storedfigure image data is specifically:

unlocking the vehicle if the gait feature of the figure recognized inthe image data matches any stored gait feature.

According to a fourth aspect, a vehicle unlocking apparatus is provided.The apparatus includes a memory and a processor. The memory isconfigured to store a wakeup keyword, where the wakeup keyword is usedto indicate that a vehicle currently needs to be unlocked. The processoris configured to obtain a voice signal, determine an acoustic sourcedirection based on the voice signal if the voice signal includes thewakeup keyword, and indicate a target camera to collect image data,where the target camera is a camera that is installed on the vehicle andwhose angle of view can ensure that the acoustic source direction iscaptured. The processor is further configured to unlock the vehiclebased on the image data.

Optionally, the processor is further configured to select, based on theacoustic source direction and from a plurality of cameras installed onthe vehicle, a camera whose angle of view can ensure that the acousticsource direction is captured, and determine the selected camera as thetarget camera.

Optionally, the step of unlocking, by the processor, the vehicle basedon the received image data collected by the target camera isspecifically:

matching the image data collected by the target camera with storedfigure image data; and

unlocking the vehicle if the image data collected by the target cameramatches the stored figure image data.

According to a fifth aspect, a vehicle is provided, where the vehicleincludes a memory, a processor, and at least one camera. The memory isconfigured to store a wakeup keyword, where the wakeup keyword is usedto indicate that a vehicle currently needs to be unlocked. The processoris configured to obtain a voice signal, and determine an acoustic sourcedirection based on the voice signal if the voice signal includes thewakeup keyword. The processor is further configured to select a targetcamera from the at least one camera based on the acoustic sourcedirection, send an instruction to the target camera, and unlock thevehicle based on received image data collected by the target camera. Thetarget camera is configured to collect the image data according to theinstruction sent by the processor, where the target camera is a camerathat is installed on the vehicle and whose angle of view can ensure thatthe acoustic source direction is captured.

Optionally, the processor is further configured to select, based on theacoustic source direction and from a plurality of cameras installed onthe vehicle, a camera whose angle of view can ensure that the acousticsource direction is captured, and determine the selected camera as thetarget camera.

Optionally, the step of unlocking, by the processor, the vehicle basedon the received image data collected by the target camera isspecifically:

matching the image data collected by the target camera with storedfigure image data; and

unlocking the vehicle if the image data collected by the target cameramatches the stored figure image data.

According to a sixth aspect, a computer readable storage medium isprovided. The computer readable storage medium stores an instruction,and when the instruction is run on a computer, the computer is enabledto perform the vehicle unlocking method in the first aspect.

According to a seventh aspect, a computer program product including aninstruction is provided. When the instruction is run on a computer, thecomputer is enabled to perform the vehicle unlocking method in the firstaspect.

Technical effects obtained in the second aspect, the third aspect, thefourth aspect, the fifth aspect, the sixth aspect, and the seventhaspect are similar to the technical effects obtained by using thetechnical means corresponding to the first aspect, and details are notdescribed herein again.

The technical solutions provided in this application include at leastthe following beneficial effects:

In this application, first, the wakeup keyword detection is performed onthe obtained voice signal. If the voice signal includes the wakeupkeyword, in other words, the wakeup keyword detection succeeds, theacoustic source direction is determined, and the target camera isindicated to collect the image data. The target camera is a camera thatis installed on the vehicle and whose angle of view can ensure that theacoustic source direction is captured. Then, the vehicle is unlockedbased on the image data collected by the target camera. Compared with amethod in which a user manually enters a password in a relatedtechnology, a process of performing the wakeup keyword detection on thevoice signal and unlocking the vehicle based on the image data in thissolution takes a relatively short time, in other words, a vehicleunlocking speed in this solution is relatively high. In addition, inthis solution, the vehicle is unlocked based on the wakeup keyword andthe image data, and the vehicle cannot be unlocked if there is no wakeupkeyword or the image data does not match the stored figure image data.Therefore, security of this solution is relatively high.

BRIEF DESCRIPTION OF DRAWINGS

FIG. 1 is a schematic structural diagram of a vehicle according to anexample embodiment of this application;

FIG. 2 is a schematic structural diagram of a microphone according to anexample embodiment of this application;

FIG. 3 is a schematic structural diagram of a vehicle according toanother example embodiment of this application;

FIG. 4 is a flowchart of a vehicle unlocking method according to anexample embodiment of this application;

FIG. 5 is a schematic diagram of a method for determining an imageregion according to an example embodiment of this application;

FIG. 6 is a flowchart of a vehicle unlocking method according to anotherexample embodiment of this application; and

FIG. 7 is a schematic structural diagram of a vehicle unlockingapparatus according to an example embodiment of this application.

DESCRIPTION OF EMBODIMENTS

To make the objectives, technical solutions, and advantages of thisapplication clearer, the following further describes the implementationsof this application in detail with reference to the accompanyingdrawings.

FIG. 1 is a schematic structural diagram of vehicle 100 according to anembodiment of this application. Referring to FIG. 1 , a microphone 101and a plurality of cameras 102 are installed on the vehicle 100, and thevehicle further includes an in-vehicle infotainment 103 (not shown),also be referred to as a vehicle unlocking apparatus. The microphone 101may be in a communication connection to the in-vehicle infotainment 103,and each camera 102 may be in a communication connection to thein-vehicle infotainment 103. In some embodiments, the microphone 101 maybe further in a communication connection to each camera 102.

The microphone 101 may be installed at a middle position outside a roofof the vehicle 100, for example, a position 11 in FIG. 1 , or may beinstalled at another position outside a vehicle body. The cameras 102may be installed at different positions outside the vehicle body, eachcamera 102 corresponds to one angle of view, and a sum of angles of viewof the cameras 102 is not less than 360 degrees. Referring to FIG. 1 ,it is assumed that four cameras 102 are installed. The four cameras 102may be respectively located at a position 21, a position 22, a position23, and a position 24 in the figure. An angle of view of each camera maybe shown by dashed lines in FIG. 1 . The angles of view of the cameras102 may be the same or different, and each angle of view may be 120degrees, 160 degrees, or the like. Each camera 102 may be a wide anglecamera, an infrared camera, a binocular camera, or the like. A camera ofa corresponding type may be installed on the vehicle according to anactual application.

The microphone 101 is a low power consumption apparatus, may be in aworking state in real time, and is configured to collect a voice signalin an environment, and send the collected voice signal to the in-vehicleinfotainment 103 for voice processing, including wakeup keyworddetection, acoustic source localization, and the like. In some otherembodiments, the voice processing may further include voiceprintmatching.

The microphone 101 may be a microphone array including a plurality ofmicrophones, for example, a microphone array including four, six, oreight microphones, and each microphone may be an omnidirectionalmicrophone. For example, referring to FIG. 2 , the microphone 101 is aplanar microphone array including eight microphones, and the eightmicrophones may be evenly distributed on a 360-degree ring on a plane.

The in-vehicle infotainment 103 is configured to receive the voicesignal sent by the microphone 101, and after detecting a wakeup keywordand determining an acoustic source direction, control the target camera102 to be turned on, in other words, wake up the target camera 102. Thetarget camera 102 is a camera that is installed on the vehicle 100 andwhose angle of view can ensure that the acoustic source direction iscaptured.

The camera 102 is a high power consumption apparatus. When the targetcamera 102 is not woken up, each camera 102 is in an off state. Afterthe target camera 102 is woken up, the target camera 102 is in a workingstate, and the other cameras 102 are still in the off state. The targetcamera 102 may collect image data in an angle of view of the targetcamera 102, and send the collected image data to the in-vehicleinfotainment 103, and then the target camera 102 may be turned offaccording to control of the in-vehicle infotainment 103.

The in-vehicle infotainment 103 is further configured to receive theimage data collected by the target camera 102, and control, based on thecollected image data, the vehicle 100 to be unlocked.

In some other embodiments, only one camera 102 may be installed on thevehicle 100, and the camera 102 may be installed at a middle positionoutside the roof of the vehicle. In this way, the camera 102 may be usedas the target camera 102. After determining the acoustic sourcedirection, the in-vehicle infotainment 103 may control the target camera102 to rotate to adjust an angle, so that an angle of view of the targetcamera 102 can ensure that the acoustic source direction is captured.

In some embodiments, the in-vehicle infotainment 103 in FIG. 1 mayinclude a processor 302, a controller 303, and a memory 304 in FIG. 3 .

In other words, referring to FIG. 3 , a vehicle 300 may include a sensormodule 301, the processor 302, the controller 303, and the memory 304.The sensor module 301 may include a microphone 3011 and a plurality ofcameras 3012, and the processor 302 may include a voice processing unit3021 and an image processing unit 3022, and the vehicle body controller303 may include a camera control unit 3031 and a vehicle lock controlunit 3032.

The microphone 3011 and the plurality of cameras 3012 that are includedin the sensor module 301 are respectively configured to collect a voicesignal and image data. The microphone 3011 may be a microphone arrayincluding a plurality of microphones, and each camera 3012 may be asingle camera, a binocular camera, an infrared camera, a wide anglecamera, or the like.

The voice processing unit 3021 and the image processing unit 3022 thatare included in the processor 302 are respectively configured to processthe collected voice signal and image data. The processor 302 may be ageneral purpose central processing unit (CPU), a network processor (NP),a microprocessor, or one or more integrated circuits configured toimplement the solutions of this application, for example, anapplication-specific integrated circuit (ASIC), a programmable logicdevice (PLD), or a combination thereof. The PLD may be a complexprogrammable logic device (CPLD), a field-programmable gate array(FPGA), a generic array logic (GAL), or any combination thereof.

The camera control unit 3031 and the vehicle lock control unit 3032 thatare included in the controller 303 are respectively configured tocontrol each camera 3012 to be turned on or turned off and control thevehicle to be unlocked.

Optionally, the controller 303 may be included in the in-vehicleinfotainment 103. In other words, the in-vehicle infotainment 103 mayinclude the controller 303, or the in-vehicle infotainment 103 does notinclude the controller 303. If the in-vehicle infotainment 103 includesthe controller 303, after processing the voice signal or the image data,the processor 302 may send indication information to the controller 303,and the controller 303 controls, according to the indicationinformation, the camera 3012 to be turned on or off, or controls thevehicle to be unlocked. If the in-vehicle infotainment 103 does notinclude the controller 303, after processing the voice signal or theimage data, the processor 302 may indicate the camera 3012 to be turnedon or off or control the vehicle to be unlocked.

The memory 304 is configured to store data, program code, and the likein the embodiments of this application. The memory 304 may be aread-only memory (ROM), or may be a random access memory (RAM), or maybe an electrically erasable programmable read-only memory (EEPROM), oran optical disc (including a compact disc read-only memory (CD-ROM), acompact disc, a laser disc, a digital versatile disc, a Blu-ray disc, orthe like), a magnetic storage medium or another magnetic storage device,or any other medium that can carry or store expected program code in aform of an instruction or a data structure and that can be accessed by acomputer, but is not limited thereto.

The vehicle 300 further includes a communications bus and acommunications interface, and the communications bus is configured totransmit information between the foregoing components. Thecommunications bus may be classified into an address bus, a data bus, acontrol bus, and the like. The communications interface uses anyapparatus such as a transceiver, and is configured to communicate withanother device or a communications network. The communications interfaceincludes a wired communications interface, and may further include awireless communications interface. The wired communications interface,for example, may be an Ethernet interface. The Ethernet interface may bean optical interface, an electrical interface, or a combination thereof.The wireless communications interface may be a wireless local areanetwork (wireless local area networks, WLANs) interface, a cellularnetwork communications interface, or a combination thereof. The memory304 may exist independently, and is connected to the processor 302 byusing the communications bus, or the memory 304 may be integrated withthe processor 302.

In some embodiments, the vehicle 300 may include a plurality ofprocessors, and each of these processors may be a single-core processor,or may be a multi-core processor. The processor may be one or moredevices, circuits, and/or processing cores used to process data (forexample, a computer program instruction).

In some embodiments, the memory 304 is configured to store program codefor performing the solutions in this application, and the processor 302may execute the program code stored in the memory 203. The program codemay include one or more software modules. The vehicle may implement, byusing the processor 302, the controller 303, and the program code in thememory 304, a vehicle unlocking method provided in the followingembodiment in FIG. 4 .

FIG. 4 is a flowchart of a vehicle unlocking method according to anembodiment of this application. An example in which the method isapplied to the in-vehicle infotainment 103 in FIG. 1 is used fordescription. Referring to FIG. 4 , the method includes the followingsteps.

Step 401: Obtain a voice signal.

In this embodiment of this application, a microphone is installed on avehicle, and the microphone may collect a voice signal in anenvironment, and transmit the collected voice signal to the in-vehicleinfotainment.

It should be noted that the microphone is a low power consumptionapparatus.

Therefore, the microphone may be in a working state at all times, tocollect the voice signal in the environment in real time.

Step 402: Determine an acoustic source direction based on the voicesignal if the voice signal includes a wakeup keyword.

In this embodiment of this application, after the voice signal isobtained, wakeup keyword detection may be performed on the voice signal,where the wakeup keyword is used to indicate that the vehicle currentlyneeds to be unlocked. If it is detected that the voice signal includesthe wakeup keyword, it may be determined that the wakeup keyworddetection succeeds.

The wakeup keyword is set based on a user voice. After obtaining a uservoice used to set the wakeup keyword, the in-vehicle infotainmentprocesses the user voice to obtain a spectrogram or text information ofthe user voice, and stores the spectrogram or the text information forthe wakeup keyword detection. After receiving the voice signal in theenvironment, the microphone may send the voice signal to the in-vehicleinfotainment. The in-vehicle infotainment may process the voice signalto obtain a spectrogram or text information of the voice signal. If thespectrogram of the voice signal matches the spectrogram of the wakeupkeyword, it may be determined that the voice signal includes the wakeupkeyword, or if the text information of the voice signal includes thewakeup keyword, it may be determined that the voice signal includes thewakeup keyword.

For example, it is assumed that the wakeup keyword is set based on auser voice “Xiaoyi Xiaoyi” made by a user, the wakeup keyword is “XiaoyiXiaoyi”, and the in-vehicle infotainment stores a spectrogram of thewakeup keyword. If the user says “Xiaoyi Xiaoyi” outside the vehicle,and a spectrogram that is of the voice signal and that is obtained afterthe in-vehicle infotainment processes the obtained voice signal matchesthe spectrogram of the wakeup keyword, it may be determined that thevoice signal includes the wakeup keyword.

It can be learned from the foregoing that the microphone may be amicrophone array including a plurality of microphones, to collect aplurality of voice signals. After collecting the plurality of voicesignals, the microphone may send the collected plurality of voicesignals to the in-vehicle infotainment. The in-vehicle infotainment mayprocess the received plurality of voice signals according to abeamforming algorithm, to form a beam signal having a highestsignal-to-noise ratio, or obtain a plurality of beam signals whosesignal-to-noise ratios are greater than a signal-to-noise ratiothreshold.

For example, it is assumed that currently, only one person speaks loudlyoutside the vehicle. Then, after the microphone collects a plurality ofvoice signals and sends the plurality of voice signals to the in-vehicleinfotainment, the in-vehicle infotainment may process the plurality ofvoice signals according to the beamforming algorithm, to obtain a beamsignal having a highest signal-to-noise ratio. It is assumed thatcurrently, a plurality of people speak loudly outside the vehicle. Then,more than one beam signal whose signal-to-noise ratios are greater thanthe signal-to-noise ratio threshold may be finally determined.

After obtaining the one or more beam signals according to thebeamforming algorithm, the in-vehicle infotainment may process the oneor more beam signals to perform the wakeup keyword detection. Forexample, the in-vehicle infotainment may process the one or more beamsignals to obtain a spectrogram of each beam signal. If there is a beamsignal whose spectrogram matches the spectrogram of the wakeup keyword,it may be determined that the obtained voice signal includes the wakeupkeyword. Alternatively, the in-vehicle infotainment may perform speechrecognition on the one or more beam signals to obtain text informationof each beam signal. If there is a beam signal whose text informationincludes the wakeup keyword, it may be determined that the obtainedvoice signal includes the wakeup keyword.

After determining that the obtained voice signal includes the wakeupkeyword, the in-vehicle infotainment may further perform acoustic sourcelocalization according to the beamforming algorithm, to determine theacoustic source direction, that is, determine an acoustic sourcedirection corresponding to the beam signal including the wakeup keyword.A principle of performing the acoustic source localization according tothe beamforming algorithm is: For a same acoustic source, phases of theplurality of voice signals collected by the plurality of microphones maybe different. Delay estimation may be performed based on the phasescorresponding to the plurality of voice signals, and then, the acousticsource localization is performed based on a delay error, to determinethe acoustic source direction.

For example, the microphone in FIG. 2 is still used as an example. Themicrophone is a planar microphone array including eight microphones. Theeight microphones may be evenly distributed on a 360-degree ringhorizontally. Each microphone collects one voice signal, and the eightmicrophones may collect eight voice signals. The microphone may send thecollected eight voice signals to the in-vehicle infotainment. Thein-vehicle infotainment may obtain one or more beam signals according tothe beamforming algorithm, and perform the wakeup keyword detection oneach beam signal. After determining that there is a beam signalincluding the wakeup keyword, the in-vehicle infotainment may furtherperform the acoustic source localization according to the beamformingalgorithm, to determine an acoustic source direction of the beam signalincluding the wakeup keyword.

Optionally, in this embodiment of this application, to further improvesecurity, after the wakeup keyword detection is performed on theobtained voice signal, voiceprint feature matching may be furtherperformed, and after the voiceprint feature matching succeeds, theacoustic source direction is determined. To be specific, if the voicesignal includes the wakeup keyword, a voiceprint feature in the voicesignal is extracted, and if the extracted voiceprint feature matches anystored voiceprint feature, the acoustic source direction is determinedbased on the voice signal.

In this embodiment of this application, the in-vehicle infotainmentstores a voiceprint feature of a figure. After determining that theobtained voice signal includes the wakeup keyword, the in-vehicleinfotainment may extract the voiceprint feature in the voice signal, andmay further determine the acoustic source direction based on the voicesignal if the extracted voiceprint feature matches any stored voiceprintfeature.

For example, after obtaining the one or more beam signals according tothe beamforming algorithm, and determining the beam signal including thewakeup keyword, the in-vehicle infotainment may extract a voiceprintfeature in the beam signal, and if the extracted voiceprint featurematches any stored voiceprint feature, the in-vehicle infotainment mayfurther determine, according to the beamforming algorithm, the acousticsource direction of the beam signal including the wakeup keyword.

It should be noted that in this embodiment of this application, thein-vehicle infotainment may store voiceprint features of one or morefigures. If the stored voiceprint feature includes a voiceprint featurethat matches the extracted voiceprint feature of the voice signal, thein-vehicle infotainment may determine that the voiceprint featurematching succeeds. In addition, the in-vehicle infotainment may furtherstore a spectrogram or text information of one or more wakeup keywords.If the obtained voice signal includes any one of the one or more wakeupkeywords, the in-vehicle infotainment may determine that the wakeupkeyword detection succeeds.

In some other embodiments, after obtaining the voice signal, thein-vehicle infotainment may first perform the wakeup keyword detection,then determine the acoustic source direction, and then perform thevoiceprint feature matching.

Step 403: Indicate, based on the acoustic source direction, a targetcamera to collect image data.

In this embodiment of this application, after determining the acousticsource direction, the in-vehicle infotainment may control the targetcamera to be turned on to collect the image data. The target camera is acamera that is installed on the vehicle and whose angle of view canensure that the acoustic source direction is captured.

In a possible case, a plurality of cameras are installed on the vehicle.The in-vehicle infotainment may select, based on the acoustic sourcedirection and from the plurality of cameras installed on the vehicle, acamera whose angle of view can ensure that the acoustic source directionis captured, determine the selected camera as the target camera, andindicate, based on the acoustic source direction, the target camera tocollect the image data.

In this embodiment of this application, each of the plurality of camerasinstalled on the vehicle may correspond to one angle of view, each angleof view is a part of 360 degrees on a plane, and the acoustic sourcedirection may be an angle value. For example, it is assumed that theacoustic source direction is 50 degrees, and an angle of view of one ofthe plurality of cameras is 0 degrees to 160 degrees. Then, the cameramay be determined as the target camera, and the target camera is turnedon to collect image data.

Optionally, after determining the target camera, the in-vehicleinfotainment may further control the target camera to rotate to adjustan angle, so that a middle position of the angle of view of the targetcamera overlaps the acoustic source direction, and the target cameracaptures a complete figure image as far as possible.

It should be noted that in this embodiment of this application, theremay be an overlapping part between the plurality of angles of view ofthe plurality of cameras. If the acoustic source direction is located onan overlapping part between angles of view of two cameras, thein-vehicle infotainment may control both of the two cameras to be turnedon as target cameras.

In some other possible cases, only one camera may be installed on thevehicle, and the camera may be installed at a middle position outside aroof of the vehicle. In this way, the camera is the target camera. Afterdetermining the acoustic source direction, the in-vehicle infotainmentmay control the target camera to rotate to adjust an angle, so that theangle of view of the target camera can ensure that the acoustic sourcedirection is captured. For example, the middle position of the angle ofview of the target camera may be enabled to overlap the acoustic sourcedirection, and then the in-vehicle infotainment may indicate the targetcamera to collect the image data.

After the target camera is turned on to collect the image data, thetarget camera may send the collected image data to the in-vehicleinfotainment, and the in-vehicle infotainment may process the receivedimage data, to determine whether to unlock the vehicle.

Step 404: Unlock the vehicle based on the image data collected by thetarget camera.

In this embodiment of this application, the in-vehicle infotainment maystore figure image data, and the in-vehicle infotainment may performimage processing on the image data collected by the target camera, tounlock the vehicle based on an image processing result.

Optionally, a process of performing, by the in-vehicle infotainment, theimage processing may be: matching the image data collected by the targetcamera with the stored figure image data, and then unlocking the vehicleif the image data collected by the target camera matches the storedfigure image data.

In this embodiment of this application, the image processing may befacial recognition, gait recognition, iris recognition, or the like, andcorrespondingly, the figure image data stored in the in-vehicleinfotainment may include a facial feature, a gait feature, an irisfeature, or the like. If image data matching is performed based onfacial recognition or an iris feature, the image data collected by thetarget camera may be an image or a video. If image data matching isperformed based on a gait feature, the image data collected by thetarget camera may be a video. In other words, in this embodiment of thisapplication, there may be a plurality of implementations of performing,by the in-vehicle infotainment, the image data matching. The followingdescribes two implementations.

In a first implementation, the figure image data stored in thein-vehicle infotainment includes a facial feature, and the in-vehicleinfotainment performs the image data matching based on facialrecognition.

It can be learned from the foregoing that the image data collected bythe target camera may be an image or a video. In this implementation,after collecting the image data by using the target camera, thein-vehicle infotainment may extract a facial feature of a figurerecognized in the image data, and unlock the vehicle if the facialfeature of the figure recognized in the image data matches any storedfacial feature.

It should be noted that if the image data collected by the target camerais an image, there may be one or more images, or if the image datacollected by the target camera is a video, the video may include aplurality of frames of images.

In this embodiment of this application, the in-vehicle infotainmentstores the facial feature. After obtaining the image data collected bythe target camera, the in-vehicle infotainment may recognize the face ofthe figure in the image data by using a facial recognition technology,and after recognizing the face, extract the facial feature of the figurerecognized in the image data. If the extracted facial feature of anyfigure matches any stored facial feature, it may be determined that thefacial recognition succeeds, and the vehicle is unlocked.

For example, the facial feature may be a geometric feature, for example,a five-sense feature or a five-sense feature and a distance featurebetween the five senses. One or more figures may be recognized, and afacial feature of each recognized figure is extracted. If a similaritybetween an extracted facial feature of a figure and any stored facialfeature exceeds a face similarity threshold, the in-vehicle infotainmentmay determine that facial feature matching succeeds, and further controlthe vehicle to be unlocked.

If image processing is performed on each region of an entire image, arelatively large amount of data needs to be processed. The acousticsource direction has been determined above, and a time interval fromwhen the acoustic source direction is determined to when the image datais collected is very short. Based on this, after obtaining the imagedata collected by the target camera, the in-vehicle infotainment mayalternatively extract a facial feature of a figure that is recognized inthe image data and that is located in the acoustic source direction, andunlock the vehicle if the extracted facial feature matches any storedfacial feature. To be specific, the in-vehicle infotainment may performthe facial recognition in a small range based on the acoustic sourcedirection, to greatly reduce a computing amount of image processing andfurther increase an unlocking speed.

In this embodiment of this application, on the basis that the acousticsource direction is determined, the in-vehicle infotainment maydetermine an image region from the collected image data based on theacoustic source direction, and extract a facial feature of a figurerecognized in the image region.

An implementation of determining the image region from the collectedimage data based on the acoustic source direction may be: determining arelative position of the acoustic source direction in the angle of viewof the target camera; determining a reference position on the image databased on the relative position; and extending the reference position onthe image data to each of a left side and a right side by a referenceneighborhood range, to obtain the image region.

For example, referring to FIG. 5 , it is assumed that the angle of viewof the target camera is a sector range of 0 degrees to 160 degrees inthe figure, the left is 0 degrees, the right is 160 degrees, theacoustic source direction is 80 degrees, a width of an image is 16centimeters, and the reference neighborhood range is 1/16 of the entirewidth of the image, namely, one centimeter. Then, it may be determinedthat the acoustic source direction is located at a position of ½ of theangle of view, that is, the relative position is ½ of the angle of view.Based on the relative position, it may be determined that the referenceposition is a position of ½ of the image, and image data obtained byextending the position of ½ of the image to each of a left side and aright side by 1/16 of the width of the image, namely, by one centimeter,is determined as the image region. In other words, a corresponding imageregion obtained by extending the acoustic source direction, namely, 80degrees, to each of the left side and the right side by 10 degrees isthe image region.

In a second implementation, the figure image data stored in thein-vehicle infotainment includes a gait feature, and the in-vehicleinfotainment performs the image data matching based on gait recognition.

It can be learned from the foregoing that the image data collected bythe target camera may be a video. In this implementation, afterreceiving the image data collected by the target camera, the in-vehicleinfotainment may extract a gait feature of a figure recognized in thevideo, and unlock the vehicle if the gait feature of the figurerecognized in the image data matches any stored gait feature.

In this embodiment of this application, the in-vehicle infotainmentstores a gait feature of a user. After obtaining the video collected bythe target camera, the in-vehicle infotainment may extract a gaitfeature of each figure in the video based on a gait recognitiontechnology, and if the extracted gait feature of any figure matches anystored gait feature, determine that the gait recognition succeeds, andunlock the vehicle.

Optionally, the image data collected by the target camera is a video.After collecting the image data by using the target camera, first, thein-vehicle infotainment may alternatively extract a facial feature of afigure recognized in the video, and determine, as a target figure, arecognized figure whose facial feature matches any stored facialfeature. Then, the in-vehicle infotainment extracts a gait feature ofthe target figure recognized in the video, and unlocks the vehicle ifthe gait feature of the target figure matches any stored gait feature.

In this embodiment of this application, if gait recognition is performedon all image data of the collected video, a relatively large amount ofdata needs to be processed. Therefore, the in-vehicle infotainment mayperform facial recognition on the collected video, to obtain a figurematching any stored facial feature, further track the figure, andperform gait recognition only on the figure in the video. A process ofperforming the facial recognition on the collected image data may be:processing image data of an entire image, or processing image data in asmall range based on the acoustic source direction. For a relatedimplementation, refer to the foregoing related descriptions, and detailsare not described herein again.

It should be noted that if a similarity between an extracted gaitfeature of a figure and any stored gait feature exceeds a gaitsimilarity threshold, the in-vehicle infotainment may determine thatgait feature matching succeeds, and further control the vehicle to beunlocked. Alternatively, if a similarity between the extracted gaitfeature of the target figure and any stored gait feature exceeds a gaitsimilarity threshold, the in-vehicle infotainment may determine thatgait feature matching succeeds.

In some embodiments, the target camera may continuously collect aplurality of frames of images after being turned on. In a collectionprocess, each collected frame of image is sent to the in-vehicleinfotainment in real time. The in-vehicle infotainment may performfacial recognition and facial feature matching on each received frame ofimage in real time, or generate a video stream based on the receivedplurality of frames of images, recognize a figure in the video in realtime, and extract a gait feature of each recognized figure or arecognized target figure in real time. After the facial feature matchingor the gait feature matching succeeds, the in-vehicle infotainment maycontrol the vehicle to be unlocked.

After the image data is collected by using the target camera, if theimage data collected by the target camera does not match the stored userimage data, the in-vehicle infotainment may turn off the target camera,to reduce power consumption, or the in-vehicle infotainment may furthercontrol, based on a voice of a figure, the target camera to remain in anon or off state, or if no figure is recognized in a time period, thein-vehicle infotainment may control the target camera to be turned off.

In this embodiment of this application, if the image data, for example,an image or a video, collected by the target camera does not match thestored figure image data, for example, facial recognition fails, gaitrecognition fails, iris recognition fails, no face is recognized, nogait is recognized, or no iris is recognized, the in-vehicleinfotainment may control the camera to be turned off, to reduce powerconsumption.

FIG. 6 is a flowchart of another vehicle unlocking method according toan embodiment of this application. It is assumed that a plurality ofcameras are installed on a vehicle. Referring to FIG. 6 , a microphonearray installed on the vehicle collects voice signals in real time andsends the voice signal to an in-vehicle infotainment. The in-vehicleinfotainment obtains one or more beam signals according to a beamformingalgorithm, determines a beam signal having a highest signal-to-noiseratio, determines an acoustic source direction of the beam signalaccording to the beamforming algorithm, and performs wakeup keyworddetection on the beam signal. If no wakeup keyword is detected, thein-vehicle infotainment continues to obtain a collected voice signal, orif a wakeup keyword is detected, extracts a voiceprint feature of thebeam signal, and performs voiceprint feature matching. If the voiceprintfeature does not match a stored voiceprint feature, the in-vehicleinfotainment continues to obtain a collected voice signal. If thevoiceprint feature matches the stored voiceprint feature, the in-vehicleinfotainment controls a target camera corresponding to the acousticsource direction to be turned off, to collect image data. The targetcamera sends the collected image data to the in-vehicle infotainment,and the in-vehicle infotainment extracts a facial feature in thecollected image data, and unlocks the vehicle if facial feature matchingsucceeds, or turns off the target camera if facial feature matchingfails.

It should be noted that when the image data does not match stored figureimage data in preset duration, the in-vehicle infotainment may controlthe target camera to be turned off.

To sum up, in this embodiment of this application, first, the wakeupkeyword detection is performed on the obtained voice signal. If thevoice signal includes the wakeup keyword, in other words, the wakeupkeyword detection succeeds, the acoustic source direction is determined,and the target camera is indicated to collect the image data. The targetcamera is a target camera that is installed on the vehicle and whoseangle of view can ensure that the acoustic source direction is captured.Then, the vehicle is unlocked based on the image data collected by thetarget camera. Compared with a method in which a user manually enters apassword in a related technology, a process of performing the wakeupkeyword detection on the voice signal and unlocking the vehicle based onthe image data in this solution takes a relatively short time, in otherwords, a vehicle unlocking speed in this solution is relatively high. Inaddition, in this solution, the vehicle is unlocked based on the wakeupkeyword and the image data, and the vehicle cannot be unlocked if thereis no wakeup keyword or the image data does not match the stored figureimage data. Therefore, security of this solution is relatively high.

FIG. 7 is a schematic structural diagram of a vehicle unlockingapparatus according to an embodiment of this application. The vehicleunlocking apparatus 700 may be implemented as a part or all of a vehicleby using software, hardware, or a combination of software and hardware,and the vehicle may be the vehicle in FIG. 1 . Referring to FIG. 7 , theapparatus 700 includes an obtaining module 701, a determining module702, an indication module 703, and an unlocking module 704.

The obtaining module 701 is configured to obtain a voice signal.

The determining module 702 is configured to determine an acoustic sourcedirection based on the voice signal if the voice signal includes awakeup keyword, where the wakeup keyword is used to indicate that avehicle currently needs to be unlocked.

The indication module 703 is configured to indicate, based on theacoustic source direction, a target camera to collect image data, wherethe target camera is a camera that is installed on the vehicle and whoseangle of view can ensure that the acoustic source direction is captured.

The unlocking module 704 is configured to unlock the vehicle based onthe image data collected by the target camera.

Optionally, the indication module 703 includes:

a first determining unit, configured to select, based on the acousticsource direction and from a plurality of cameras installed on thevehicle, a camera whose angle of view can ensure that the acousticsource direction is captured, and determine the selected camera as thetarget camera; and a first indication unit, configured to indicate,based on the acoustic source direction, the target camera to collect theimage data.

Optionally, the unlocking module 704 includes:

a matching unit, configured to match the image data collected by thetarget camera with stored figure image data; and

an unlocking unit, configured to unlock the vehicle if the image datacollected by the target camera matches the stored figure image data.

Optionally, the stored figure image data includes a facial feature;

the matching unit is specifically configured to:

a first extraction subunit, configured to extract a facial feature of afigure recognized in the image data; and the unlocking unit includes:

a first unlocking subunit, configured to unlock the vehicle if thefacial feature of the figure recognized in the image data matches anystored facial feature.

Optionally, the stored figure image data includes a facial feature;

the first extraction subunit is further configured to:

extract a facial feature of a figure that is recognized in the imagedata and that is located in the acoustic source direction; and

the first unlocking subunit is further configured to:

unlock the vehicle if the extracted facial feature matches any storedfacial feature.

Optionally, the first extraction subunit is specifically configured to:

determine an image region from the image data based on the acousticsource direction; and

extract a facial feature of a figure recognized in the image region.

Optionally, the first extraction subunit is further specificallyconfigured to:

determine a relative position of the acoustic source direction in theangle of view of the target camera;

determine a reference position on the image data based on the relativeposition; and

extend the reference position on the image data to each of a left sideand a right side by a reference neighborhood range, to obtain the imageregion

Optionally, the image data collected by the target camera is a video,and the stored figure image data includes a gait feature;

the matching unit includes:

a second extraction subunit, configured to extract a gait feature of afigure recognized in the video; and the unlocking unit includes:

a second unlocking subunit, configured to unlock the vehicle if the gaitfeature of the figure recognized in the image data matches any storedgait feature.

Optionally, the image data collected by the target camera is a video,and the stored figure image data includes a gait feature;

the second extraction subunit is specifically configured to:

extract a facial feature of a figure recognized in the image data;

determine, as a target figure, a recognized figure whose facial featurematches any stored facial feature; and

extract a gait feature of the target figure recognized in the imagedata; and

the second unlocking subunit is specifically configured to:

unlock the vehicle if the gait feature of the target figure matches anystored gait feature.

Optionally, the determining module includes:

an extraction unit, configured to: if the voice signal includes thewakeup keyword, extract a voiceprint feature in the voice signal; and

a second determining unit, configured to: if the extracted voiceprintfeature matches any stored voiceprint feature, determine the acousticsource direction based on the voice signal.

Optionally, the unlocking module 704 further includes:

a second indication unit, configured to indicate the target camera to beturned off if the image data collected by the target camera does notmatch the stored figure image data.

In this embodiment of this application, first, wakeup keyword detectionis performed on the obtained voice signal. If the voice signal includesthe wakeup keyword, in other words, the wakeup keyword detectionsucceeds, the acoustic source direction is determined, and the targetcamera is indicated to collect the image data. The target camera is atarget camera that is installed on the vehicle and whose angle of viewcan ensure that the acoustic source direction is captured. Then, thevehicle is unlocked based on the image data collected by the targetcamera. Compared with a method in which a user manually enters apassword in a related technology, a process of performing the wakeupkeyword detection on the voice signal and unlocking the vehicle based onthe image data in this solution takes a relatively short time, in otherwords, a vehicle unlocking speed in this solution is relatively high. Inaddition, in this solution, the vehicle is unlocked based on the wakeupkeyword and the image data, and the vehicle cannot be unlocked if thereis no wakeup keyword or the image data does not match the stored figureimage data. Therefore, security of this solution is relatively high.

It should be noted that the vehicle unlocking apparatus provided in theforegoing embodiments is described only using division into theforegoing function modules. In practice, the functions may be allocatedto different function modules for implementation as required. To bespecific, an internal structure of an apparatus is divided intodifferent function modules to implement all or some of the functionsdescribed above. In addition, the vehicle unlocking apparatus providedin the foregoing embodiments and the vehicle unlocking method embodimentbelong to a same idea. For a specific implementation process, refer tothe method embodiment, and details are not described herein again.

An embodiment of this application provides a vehicle unlocking system.The system includes a vehicle unlocking apparatus and at least onecamera. The vehicle unlocking apparatus may be the in-vehicleinfotainment in FIG. 1 .

The vehicle unlocking apparatus may obtain a voice signal, and determinean acoustic source direction based on the voice signal if the voicesignal includes a wakeup keyword, where the wakeup keyword is used toindicate that a vehicle currently needs to be unlocked. Then, thevehicle unlocking apparatus may send an instruction to a target camerabased on the acoustic source direction, where the target camera is acamera that is in at least one camera installed on the vehicle and whoseangle of view can ensure that the acoustic source direction is captured.The target camera may collect image data according to the instructionsent by the vehicle unlocking apparatus. Then, the vehicle unlockingapparatus may unlock the vehicle based on the received image datacollected by the target camera.

An embodiment of this application provides a vehicle unlockingapparatus. The vehicle unlocking apparatus may be the in-vehicleinfotainment in FIG. 1 , and the apparatus includes a memory and aprocessor.

The memory stores a wakeup keyword, where the wakeup keyword is used toindicate that a vehicle currently needs to be unlocked.

The processor may obtain a voice signal, and determine an acousticsource direction based on the voice signal if the voice signal includesthe wakeup keyword. Then, the processor may further indicate, based onthe acoustic source direction, a target camera to collect image data,where the target camera is a camera that is installed on the vehicle andwhose angle of view can ensure that the acoustic source direction iscaptured. The processor may further receive the image data collected bythe target camera, and unlock the vehicle based on the image data.

An embodiment of this application provides a vehicle. The vehicleincludes a memory, a processor, and at least one camera. The vehicle maybe the vehicle in FIG. 1 .

The memory stores a wakeup keyword, where the wakeup keyword is used toindicate that the vehicle currently needs to be unlocked.

The processor may obtain a voice signal, and if the voice signalincludes the wakeup keyword, the processor may determine an acousticsource direction based on the voice signal. Then, the processor mayfurther select a target camera from the at least one camera based on theacoustic source direction, and send an instruction to the target camera,where the target camera is a camera that is installed on the vehicle andwhose angle of view can ensure that the acoustic source direction iscaptured. The target camera may collect image data according to theinstruction sent by the controller. Then, the processor may furtherreceive the image data collected by the target camera, and unlock thevehicle based on the image data.

It should be noted that the vehicle unlocking system, the vehicleunlocking apparatus, and the vehicle that are provided in the foregoingembodiments and the vehicle unlocking method embodiment belong to a sameidea. For specific implementation processes, refer to the methodembodiment, and details are not described herein again.

In addition, in the vehicle unlocking system, the vehicle unlockingapparatus, and the vehicle that are provided in the foregoingembodiments, first, wakeup keyword detection is performed on theobtained voice signal. If the voice signal includes the wakeup keyword,in other words, the wakeup keyword detection succeeds, the acousticsource direction is determined, and the target camera is indicated tocollect the image data. The target camera is a target camera that isinstalled on the vehicle and whose angle of view can ensure that theacoustic source direction is captured. Then, the vehicle is unlockedbased on the image data collected by the target camera. Compared with amethod in which a user manually enters a password in a relatedtechnology, a process of performing the wakeup keyword detection on thevoice signal and unlocking the vehicle based on the image data in thissolution takes a relatively short time, in other words, a vehicleunlocking speed in this solution is relatively high. In addition, inthis solution, the vehicle is unlocked based on the wakeup keyword andthe image data, and the vehicle cannot be unlocked if there is no wakeupkeyword or the image data does not match the stored figure image data.Therefore, security of this solution is relatively high.

All or some of the foregoing embodiments may be implemented by software,hardware, firmware, or any combination thereof. When software is used toimplement the embodiments, the embodiments may be implemented completelyor partially in a form of a computer program product. The computerprogram product includes one or more computer instructions. When thecomputer instructions are loaded and executed on the computer, theprocedure or functions according to the embodiments of this applicationare all or partially generated. The computer may be a general-purposecomputer, a dedicated computer, a computer network, or otherprogrammable apparatuses. The computer instructions may be stored in acomputer readable storage medium or may be transmitted from a computerreadable storage medium to another computer readable storage medium. Forexample, the computer instructions may be transmitted from a website,computer, server, or data center to another website, computer, server,or data center in a wired (for example, a coaxial cable, an opticalfiber, or a digital subscriber line (DSL)) or wireless (for example,infrared, radio, or microwave) manner. The computer readable storagemedium may be any usable medium accessible by a computer, or a datastorage device, such as a server or a data center, integrating one ormore usable media. The usable medium may be a magnetic medium (forexample, a floppy disk, a hard disk, or a magnetic tape), an opticalmedium (for example, a digital versatile disc (DVD), a semiconductormedium (for example, a solid-state drive (SSD)), or the like. It shouldbe noted that the computer readable storage medium in this applicationmay be a non-volatile storage medium, in other words, may be anon-transitory storage medium.

The foregoing descriptions are merely embodiments of this application,but are not intended to limit this application. Any modification,equivalent replacement, or improvement made without departing from thespirit and principle of this application should fall within theprotection scope of this application.

A person of ordinary skill in the art may understand that all or some ofthe steps of the embodiments may be implemented by hardware or a programinstructing related hardware. The program may be stored in a computerreadable storage medium. The storage medium may include: a read-onlymemory, a magnetic disk, or an optical disc.

The foregoing descriptions are merely example embodiments of thisapplication, but are not intended to limit this application. Anymodification, equivalent replacement, or improvement made withoutdeparting from the spirit and principle of this application should fallwithin the protection scope of this application.

What is claimed is:
 1. A vehicle unlocking method, wherein the vehicleunlocking method comprises: obtaining a voice signal; determining anacoustic source direction based on the voice signal in response todetermining that the voice signal comprises a wakeup keyword, whereinthe wakeup keyword indicates that a vehicle currently needs to beunlocked; indicating, based on the acoustic source direction, a targetcamera installed on the vehicle to collect image data, wherein an angleof view of the target camera is configured to capture the acousticsource direction; and unlocking the vehicle based on the image datacollected by the target camera.
 2. The vehicle unlocking methodaccording to claim 1, wherein the indicating, based on the acousticsource direction, a target camera to collect image data comprises:selecting, based on the acoustic source direction and from a pluralityof cameras installed on the vehicle, a camera, wherein an angle of viewof the camera is configured to capture the acoustic source direction;determining the camera that is selected as the target camera; andindicating, based on the acoustic source direction, the target camera tocollect the image data.
 3. The vehicle unlocking method according toclaim 1, wherein the unlocking the vehicle based on the image datacollected by the target camera comprises: matching the image datacollected by the target camera with stored figure image data; andunlocking the vehicle in response to determining that the image datacollected by the target camera matches the stored figure image data. 4.The vehicle unlocking method according to claim 3, wherein: the storedfigure image data comprises a facial feature; the matching the imagedata collected by the target camera with stored figure image datacomprises: extracting a facial feature of a figure recognized in theimage data; and the unlocking the vehicle in response to determiningthat the image data collected by the target camera matches the storedfigure image data comprises: unlocking the vehicle in response todetermining that the facial feature of the figure recognized in theimage data matches any stored facial feature.
 5. The vehicle unlockingmethod according to claim 3, wherein: the stored figure image datacomprises a facial feature; the matching the image data collected by thetarget camera with stored figure image data comprises: extracting afacial feature of a figure that is recognized in the image data and thatis located in the acoustic source direction; and the unlocking thevehicle in response to determining that the image data collected by thetarget camera matches the stored figure image data comprises: unlockingthe vehicle in response to determining that the extracted facial featurematches any stored facial feature.
 6. The vehicle unlocking methodaccording to claim 5, wherein the extracting a facial feature of afigure that is recognized in the image data and that is located in theacoustic source direction comprises: determining an image region fromthe image data based on the acoustic source direction; and extracting afacial feature of a figure recognized in the image region.
 7. Thevehicle unlocking method according to claim 6, wherein the determiningan image region from the image data based on the acoustic sourcedirection comprises: determining a relative position of the acousticsource direction within the angle of view of the target camera;determining a reference position on the image data based on the relativeposition; and extending the reference position on the image data to eachof a left side and a right side by a reference neighborhood range toobtain the image region.
 8. The vehicle unlocking method according toclaim 3, wherein: the image data collected by the target camera is avideo, and the stored figure image data comprises a gait feature; thematching the image data collected by the target camera with storedfigure image data comprises: extracting a gait feature of a figurerecognized in the video; and the unlocking the vehicle in response todetermining that the image data collected by the target camera matchesthe stored figure image data comprises: unlocking the vehicle inresponse to determining that the gait feature of the figure recognizedin the image data matches any stored gait feature.
 9. The vehicleunlocking method according to claim 3, wherein: the image data collectedby the target camera is a video, and the stored figure image datacomprises a gait feature; the matching the image data collected by thetarget camera with stored figure image data comprises: extracting afacial feature of a figure recognized in the image data; determining, asa target figure, a recognized figure whose facial feature matches anystored facial feature; extracting a gait feature of the target figurerecognized in the image data; and unlocking the vehicle if the gaitfeature of the target figure matches any stored gait feature.
 10. Thevehicle unlocking method according to claim 1, wherein the determiningan acoustic source direction based on the voice signal in response todetermining that the voice signal comprises a wakeup keyword comprises:in response to determining that the voice signal comprises the wakeupkeyword, extracting a voiceprint feature in the voice signal; and inresponse to determining that the extracted voiceprint feature matchesany stored voiceprint feature, determining the acoustic source directionbased on the voice signal.
 11. A vehicle unlocking apparatus, whereinthe vehicle unlocking apparatus comprises one or more memories and atleast one processor, wherein: the one or more memories are configured tostore a wakeup keyword, wherein the wakeup keyword is used to indicatethat a vehicle currently needs to be unlocked; and the one or morememories store program instructions for execution by the at least oneprocessor to: obtain a voice signal; determine an acoustic sourcedirection based on the voice signal in response to determining that thevoice signal comprises the wakeup keyword; indicate, based on theacoustic source direction, a target camera installed on the vehicle tocollect image data, wherein an angle of view of the target camera isconfigured to capture the acoustic source direction; and unlock thevehicle based on the image data collected by the target camera.
 12. Thevehicle unlocking apparatus according to claim 11, wherein the one ormore memories store the program instructions for execution by the atleast one processor to: select, based on the acoustic source directionand from a plurality of cameras installed on the vehicle, a camera,wherein an angle of view of the camera is configured to capture theacoustic source direction; and determine the camera that is selected asthe target camera.
 13. The vehicle unlocking apparatus according toclaim 11, wherein unlocking the vehicle based on the image datacollected by the target camera comprises: matching the image datacollected by the target camera with stored figure image data; andunlocking the vehicle in response to determining that the image datacollected by the target camera matches the stored figure image data. 14.The vehicle unlocking apparatus according to claim 13, wherein: thestored figure image data comprises a facial feature; matching the imagedata collected by the target camera with the stored figure image datacomprises: extracting a facial feature of a figure recognized in theimage data; and unlocking the vehicle in response to determining thatthe image data collected by the target camera matches the stored figureimage data comprises: unlocking the vehicle in response to determiningthat the facial feature of the figure recognized in the image datamatches any stored facial feature.
 15. The vehicle unlocking apparatusaccording to claim 13, wherein: the stored figure image data comprises afacial feature; matching the image data collected by the target camerawith the stored figure image data comprises: extracting a facial featureof a figure that is recognized in the image data and that is located inthe acoustic source direction; and unlocking the vehicle in response todetermining that the image data collected by the target camera matchesthe stored figure image data comprises: unlocking the vehicle inresponse to determining that the extracted facial feature matches anystored facial feature.
 16. The vehicle unlocking apparatus according toclaim 15, wherein extracting the facial feature of the figure that isrecognized in the image data and that is located in the acoustic sourcedirection comprises: determining an image region from the image databased on the acoustic source direction; and extracting a facial featureof a figure recognized in the image region.
 17. A non-transitorycomputer-readable storage medium, wherein the non-transitorycomputer-readable storage medium stores program instructions forexecution by at least one processor to: obtain a voice signal; determinean acoustic source direction based on the voice signal in response todetermining that the voice signal comprises a wakeup keyword, whereinthe wakeup keyword indicates that a vehicle currently needs to beunlocked; indicate, based on the acoustic source direction, a targetcamera installed on the vehicle to collect image data, wherein an angleof view of the target camera is configured to capture the acousticsource direction; and unlock the vehicle based on the image datacollected by the target camera.
 18. The non-transitory computer-readablestorage medium according to claim 17, wherein indicating, based on theacoustic source direction, the target camera to collect the image datacomprises: selecting, based on the acoustic source direction and from aplurality of cameras installed on the vehicle, a camera, wherein anangle of view of the target camera is configured to capture the acousticsource direction; determining the camera that is selected as the targetcamera; and indicating, based on the acoustic source direction, thetarget camera to collect the image data.
 19. The non-transitorycomputer-readable storage medium according to claim 17, whereinunlocking the vehicle based on the image data collected by the targetcamera comprises: matching the image data collected by the target camerawith stored figure image data; and unlocking the vehicle in response todetermining that the image data collected by the target camera matchesthe stored figure image data.
 20. The non-transitory computer-readablestorage medium according to claim 19, wherein: the stored figure imagedata comprises a facial feature; matching the image data collected bythe target camera with the stored figure image data comprises:extracting a facial feature of a figure recognized in the image data;and unlocking the vehicle in response to determining that the image datacollected by the target camera matches the stored figure image datacomprises: unlocking the vehicle in response to determining that thefacial feature of the figure recognized in the image data matches anystored facial feature.